During a technical livestream at 1 AM today, OpenAI officially launched its latest and most powerful multimodal models: o4-mini and the full-power o3. These models offer unique advantages, capable of processing text, images, and audio simultaneously. They also function as agents, automatically utilizing tools such as web search, image generation, and code parsing. Furthermore, they possess a deep thinking mode, enabling reasoning about images within a chain of thought.